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CLAIMS 

What is claimed is: 



1 . A methdd for identifying desirable protein backbone configurations comprising: 
5 generatmg backbone protein configurations using a set of dihedral angle pairs; 

normalizing the total surface exposure of each remaining configuration; 
generating ^random set of sequences of hydrophobicities with uniform weight on 
the space of allowed\equences; 

determining, f<Jr each randomly generated sequence, which of the remaining 
10 configurations is the ground state, and; 

recording a grounfi-state configuration for each sequence wherein the desirable 
configurations are those containing the most sequences with that configuration as their 
f j ground state. 

ry> 2. A method for identifying ^esirable protein backbone configurations as in claim 1 

wherein: 

one pair of dihedral mgj/c V^irs corresponds to an alpha helix and one pair of 
dihedral angle pairs corresponds tcmifeta strand. 

3. A method for identifying desi\able protein backbone configurations as in claim 1 

wherein: 

two sets of dihedral angles correspond to an alpha helix and one set of dihedral 
angle pairs corresponds to a beta stranc 



25 4. A method for identifying desirable prqein backbone configurations as in claim 3 

wherein: 

additional dihedral angles fall within legions of high frequency in a 
Ramachandran plot. 

30 5. A method for identifying desirable protein ^ackbone configurations as in claim 4 

wherein: 



14 



* 




the payability of choosing a particular pair of dihedral angles depends on the 
, preceedVng pairs of dihedral angles along the backbone. 

6. A method f\ identifying desirable protein backbone configurations as in claim 5 
further compr 

eliminating ^elf-intersecting configurations. 

7. A method for identifying desirable protein backbone configurations as in claim 6 
further comprising: 

eliminating non-compact configurations. 

8. A method for identifying desirable protein backbone configurations as in claim 7 
further comprising: 

clustering configurations sufficiently similar in the three dimensional trajectory 
followed by their backbones abd treating all configurations within a cluster as variants of 
a single configuration, and; 

in a cluster, the number of sequences with that 
l that the sum is considered the designability of the 



summing, for all con 
configuration as their grou; 
cluster. 



v 



9. A method for identifying desirable protein backbone configurations as in claim 8 
further comprising: 

eliminating configurations wifh low Variances. 

10. A method for identifying desirable protein backbone configurations as in claim 1 
wherein: 

the set of dihedral angle pairs is a &t of strings of dihedral angle pairs. 



11. A method for identifying desirable protean backbone configurations as in claim 1 0 
wherein: 
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the strimgs of angles are weighted according to their frequency of appearance in 
natural proteinsland infrequent strings are eliminated. 

12. A method f<\ identifying desirable protein backbone configurations as in claim 1 
wherein: 

normalizing accomplished by dividing the surface exposure of each amino acid 
in a given configuration by the total surface exposure of that configuration. 

13. A method for iden\fying desirable protein backbone configurations as in claim 1 
further comprising: 

eliminating configurations with low Variance. 

14. A method for identifying desirable protein backbone configurations as in claim 1 
further comprising: 

eliminating self-intef^ting configurations. 

15. A method for identifying |esfl[able protein backbone configurations as in claim 14 
further comprising: 

eliminating non-compact configurations. 

16. A method for identifying desirable protein backbone configurations as in claim 1 
further comprising: 

eliminating non-compact dpnfigurations. 

17. A method for identifying desirable protein backbone configurations as in claim 1 
further comprising: 

eliminating configurations with low Variance. 



18. A method for identifying desirabl^ protein backbone configurations as in claim 1 
further comprising: 
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eliminating all configurations that are not favorable for forming a large number of 
hydrogen bond\ after eliminating non-compact configurations. 

19. A method for\dentifying desirable protein backbone configurations as in claim 1 
further comprising: 

clustering configurations sufficiently similar in the three dimensional trajectory 
followed by their backWbnes and treating all configurations within a cluster as variants of 
a single configuration, an)d; 

summing, for all configurations in a cluster, the number of sequences with that 
configuration as their grounM state such that the sum is considered the designability of the 
cluster. \ 

20. A method for identifying fflesirable protein backbone configurations as in claim 19 
wherein: I \ \ 

clustering is accomplished by totaling the root-mean-square distance between 
every pair of configurations and defining a configuration as a member of a cluster if it 
lies within a root-mean-square distance X of any member of the cluster. 

21. A method for identifying desirable protein backbone configurations as in claim 20 
wherein: I 

X is 0.4 A per amino acid. 1 

22. A method for designing proteins comprising: 

generating backbone protein configurations using a set of dihedral angle pairs; 
eliminating self-intersecting configurations; 

normalizing the total surface exposure of each remaining configuration; 

generating a random set of sequences of hydrophobicities with uniform weight on 
the space of allowed sequences for each remaining configuration; 

determining, for each randomly generated sequence, which of the remaining 
configurations is the ground state; 
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recording the ground-state configuration for each sequence wherein desirable 
configurations are those containing the most sequences with that configuration as their 
ground state, and; 

synthesizing sequences of amino acids for the desirable configurations. 

23. A method for designing proteins as in claim 22 wherein: 

one set of dihedral angle pairs corresponds to an alpha helix and one set of 
dihedral angles corresponds to a beta strand. 

24. A method for designing proteins as in claim 22 wherein: 

two sets of dihedral angles correspond to an alpha helix and one set of dihedral 
angle pairs corresponds to a beta strand. 

25. A method for designing proteins as in claim 24 wherein: 

additional dihedral angle pairs fall within regions of high frequency in a 
Ramachandran plot. 

26. A method for designing proteins as in claim 25 wherein: 

the probability of choosing a particular pair of dihedral angles depends on the 
preceeding pairs of dihedral angles along the backbone. 

27. A method for designing proteins as in claim 26 further comprising: 

eliminating self-intersecting configurations. 

28. A method for designing proteins as in claim 27 further comprising: 

eliminating non-compact configurations. 

29. A method for designing proteins as in claim 28 further comprising: 
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clustering configurations sufficiently similar in the three dimensional trajectory 
followed by their backbones and treating all configurations within a cluster as variants of 
a single configuration, and; 

summing, for all configurations in a cluster, the number of sequences with that 
configuration as their ground state such that the sum is considered the designability of the 
cluster. 

30. A method for designing proteins as in claim 29 further comprising: 
recording the Variance of each configuration, ranking the configurations from 

highest Variance to lowest, and 

designing proteins starting with the configurations having the highest Variance. 

31. A method for designing proteins as in claim 22 wherein: 

the set of dihedral angles is a set of strings of dihedral angles. 

32. A method for designing proteins as in claim 3 1 wherein: 

the strings of angles are weighted according to their frequency of appearance in 
natural proteins and infrequent strings are eliminated. 

33. A method for designing proteins as in claim 22 wherein: 

normalizing is accomplished by dividing the surface exposure of each amino acid 
in a given configuration by the total surface exposure of that configuration. 

34. A method for designing proteins as in claim 22 further comprising: 
recording the Variance of each configuration, ranking the configurations from 

highest Variance to lowest, and 

designing proteins starting with the configurations having the highest Variance. 

35. A method for designing proteins as in claim 22 further comprising: 

eliminating non-compact configurations after self-intersecting configurations are 
eliminated. 
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36. A method for designing proteins as in claim 35 further comprising: 

clustering configurations sufficiently similar in the three dimensional trajectory 
followed by their backbones and treating all configurations within a cluster as variants of 
a single configuration, and; 

summing, for all configurations in a cluster, the number of sequences with that 
configuration as their ground state such that the sum is considered the designability of the 
cluster. 

37. A method for designing proteins as in claim 22 further comprising: 

eliminating all configurations that are not favorable for forming hydrogen bonds 
after eliminating non-compact configurations. 

38. A method for designing proteins as in claim 22 further comprising: 

clustering configurations sufficiently similar in the three dimensional trajectory 
followed by their backbones and treating all configurations within a cluster as variants of 
a single configuration, and; 

summing, for all configurations in a cluster, the number of sequences with that 
configuration as their ground state such that the sum is considered the designability of the 
cluster. 

39. A method for designing proteins as in claim 38 wherein: 

clustering is accomplished by totaling the root-mean-square distance between 
every pair of configurations and defining a configuration as a member of a cluster if it 
lies within a root-mean-square distance X of any member of the cluster. 

40. A method for designing proteins as in claim 39 wherein: 

X is 0.4 Angstroms per amino acid. 
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41. A methock for analyzing the designability of protein backbone configurations to 
determine if thanumber of sequences each configuration has in its ground state is larger 
than a predetermined number comprising: 

generating\ackbone protein configurations using a set of dihedral angle pairs; 

eliminating ^elf-intersecting configurations; 

normalizing tra total surface exposure of each remaining configuration; 

generating a random set of sequences of hydrophobicities with uniform weight on 
the space of allowed secmences; 

determining, for each randomly generated sequence, which of the remaining 
configurations is the grouna state; 

recording a ground-srate configuration for each sequence wherein the desirable 
configurations are those contaljfiing the most sequences with that configuration as their 
ground state, and; 

comparing how many sequences each configuration has in its ground-state with 
the predetermined number wherelW configurations with larger numbers are highly 
designable. 



42. A method for analyzing th 
claim 41 wherein: 

normalizing is accompli 



Jsifoiability of protein backbone configurations as in 



hed b\ dividing the surface exposure of each amino acid 



in a given configuration by the total sumace exposure of that configuration. 



43. A method for analyzing the designability of protein backbone configurations as in 

\t 

claim 41 wherein: 

i 1 

one set of dihedral angle pairs corresponds to an alpha helix and one set of 
< \ 

dihedral angle pairs corresponds to a beta strand. 



44. A method for analyzing the designability\>f protein backbone configurations as in 
claim 41 wherein: 

two sets of dihedral angle pairs correspoi^i to an alpha helix and one set of 
dihedral angle pairs corresponds to a beta strand. 
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45. AVnethod for analyzing the designability of protein backbone configurations as in 
claim 44 wherein: 

additional dihedral angles fall within regions of high frequency in a 
Ramachanflran plot. 

46. A method for analyzing the designability of protein backbone configurations as in 
claim 45 wher&n: 

the probability of choosing a particular pair of dihedral angles depends on the 
preceeding pairs of dihedral angles along the backbone. 

47. A method for Lialyzing the designability of protein backbone configurations as in 
claim 46 further conWising: ^ 

eliminating nob-compact configurations after self-intersecting configurations are 
eliminated. \ 

48. A method for analwrta the designability of protein backbone configurations as in 
claim 47 further compmsmg^V 

recording the \ ariabceW each configuration, ranking the configurations from 
highest Variance to lo\/est,Vna 

designing proteins starting with the configurations having the highest Variance. 

49. A method for analyzing thV designability of protein backbone configurations as in 
claim 48 further comprising: \ 

clustering configurations Sufficiently similar in the three dimensional trajectory 
followed by their backbones and treating all configurations within a cluster as variants of 
a single configuration, and; \ 

summing, for all configurations in a cluster, the number of sequences with that 
configuration as their ground state sucfo that the sum is considered the designability of the 
cluster. \ 
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50. A mVthod for analyzing the designability of protein backbone configurations as in 
claim 41 wierein: 

the sbt of dihedral angle pairs is a set of strings of dihedral angle pairs. 

51. A methocnfor analyzing the designability of protein backbone configurations as in 
claim 49 wherem: 

the stringsVf angles are weighted according to their frequency of appearance in 
natural proteins and infrequent strings are eliminated. 

52. A method for analyzing the designability of protein backbone configurations as in 
claim 41 wherein: \ 

the probability of choosing a particular pair of dihedral angles depends on the 
preceeding pairs of diheqral angles along the backbone. 

53. A method for analyziraKhe designability of protein backbone configurations as in 
claim 41 further comprisingA \ 

recording the V^ffl^clfoiWch configuration, ranking the configurations from 
highest Variance to lowest, ana \ 

designing proteins starting with the configurations having the highest Variance. 

54. A method for analyzing the designability of protein backbone configurations as in 
claim 41 further comprising: \ 

eliminating all configuration that are not favorable for forming a large number of 
hydrogen bonds after eliminating nqn-compact configurations. 

55. A method for analyzing the desigimbility of protein backbone configurations as in 
claim 41 further comprising: \ 

clustering configurations sufficiemly similar in the three dimensional trajectory 
followed by their backbones and treating a|l configurations within a cluster as variants of 
a single configuration, and; \ 
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configuration a! 
cluster. 



for all configurations in a cluster, the number of sequences with that 
.their ground state such that the sum is considered the designability of the 



56. A method for analyzing the designability of protein backbone configurations as in 
claim 55 further conpsing: 

clustering isadcoWl ished by totaling the root-mean-square distance between 
every pair of confeitfanonfe and defining a configuration as a member of a cluster if it 
lies within a root-m ^an-squ&e distance X of any member of the cluster. 

57. A method for analyzing the designability of protein backbone configurations as in 
claim 56 further comprising: 

X is 0.4 Angstroms per amino acid. 
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